1,000 research outputs found
Exploring semantic information in disease: Simple Data Augmentation Techniques for Chinese Disease Normalization
The disease is a core concept in the medical field, and the task of
normalizing disease names is the basis of all disease-related tasks. However,
due to the multi-axis and multi-grain nature of disease names, incorrect
information is often injected and harms the performance when using general text
data augmentation techniques. To address the above problem, we propose a set of
data augmentation techniques that work together as an augmented training task
for disease normalization. Our data augmentation methods are based on both the
clinical disease corpus and standard disease corpus derived from ICD-10 coding.
Extensive experiments are conducted to show the effectiveness of our proposed
methods. The results demonstrate that our methods can have up to 3\%
performance gain compared to non-augmented counterparts, and they can work even
better on smaller datasets
Bayesian Optimization with Hidden Constraints via Latent Decision Models
Bayesian optimization (BO) has emerged as a potent tool for addressing
intricate decision-making challenges, especially in public policy domains such
as police districting. However, its broader application in public policymaking
is hindered by the complexity of defining feasible regions and the
high-dimensionality of decisions. This paper introduces the Hidden-Constrained
Latent Space Bayesian Optimization (HC-LSBO), a novel BO method integrated with
a latent decision model. This approach leverages a variational autoencoder to
learn the distribution of feasible decisions, enabling a two-way mapping
between the original decision space and a lower-dimensional latent space. By
doing so, HC-LSBO captures the nuances of hidden constraints inherent in public
policymaking, allowing for optimization in the latent space while evaluating
objectives in the original space. We validate our method through numerical
experiments on both synthetic and real data sets, with a specific focus on
large-scale police districting problems in Atlanta, Georgia. Our results reveal
that HC-LSBO offers notable improvements in performance and efficiency compared
to the baselines.Comment: 8 pages, 8 figures (exclude appendix
Towards Visually Explaining Variational Autoencoders
Recent advances in Convolutional Neural Network (CNN) model interpretability
have led to impressive progress in visualizing and understanding model
predictions. In particular, gradient-based visual attention methods have driven
much recent effort in using visual attention maps as a means for visual
explanations. A key problem, however, is these methods are designed for
classification and categorization tasks, and their extension to explaining
generative models, e.g. variational autoencoders (VAE) is not trivial. In this
work, we take a step towards bridging this crucial gap, proposing the first
technique to visually explain VAEs by means of gradient-based attention. We
present methods to generate visual attention from the learned latent space, and
also demonstrate such attention explanations serve more than just explaining
VAE predictions. We show how these attention maps can be used to localize
anomalies in images, demonstrating state-of-the-art performance on the MVTec-AD
dataset. We also show how they can be infused into model training, helping
bootstrap the VAE into learning improved latent space disentanglement,
demonstrated on the Dsprites dataset
Two-Stage Submodular Optimization of Dynamic Thermal Rating for Risk Mitigation Considering Placement and Operation Schedule
Cascading failure causes a major risk to society currently. To effectively
mitigate the risk, dynamic thermal rating (DTR) technique can be applied as a
cost-effective strategy to exploit potential transmission capability. From the
perspectives of service life and Braess paradox, it is important and
challenging to jointly optimize the DTR placement and operation schedule for
changing system state, which is a two-stage combinatorial problem with only
discrete variables, suffering from no approximation guarantee and dimension
curse only based on traditional models. Thus, the present work proposes a novel
two-stage submodular optimization (TSSO) of DTR for risk mitigation considering
placement and operation schedule. Specifically, it optimizes DTR placement with
proper redundancy in first stage, and then determines the corresponding DTR
operation for each system state in second stage. Under the condition of the
Markov and submodular features in sub-function of risk mitigation, the
submodularity of total objective function of TSSO can be proven for the first
time. Based on this, a state-of-the-art efficient solving algorithm is
developed that can provide a better approximation guarantee than previous
studies by coordinating the separate curvature and error form. The performance
of the proposed algorithms is verified by case results
- …